Influence of Reverberation on Automatic Evaluation of Intelligibility with Prosodic Features

نویسندگان

  • Tino Haderlein
  • Michael Döllinger
  • Anne Schützenberger
  • Elmar Nöth
چکیده

Objective analysis of intelligibility by a speech recognizer and prosodic features was performed for close-talking recordings before. This study examined whether this is also possible for reverberated speech. In order to ensure that only the room acoustics are different, artificial reverberation was used. 82 patients after partial laryngectomy read a standardized text, 5 experienced raters assessed intelligibility perceptually on a 5-point scale. The best feature subset, determined by Support Vector Regression, consists of the word correctness of a speech recognizer, the average duration of silent pauses, the standard deviation of the F0 on the entire sample, the standard deviation of jitter, and the ratio of the durations of the voiced sections and the entire recording. A human-machine correlation of r = 0.80 was achieved for the close-talking recordings and r = 0.72 for the worst case of the examined signal qualities. By adding three more features, also r = 0.80 was reached for the reverberated scenario.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Speech Analysis for Automatic Evaluation of Shadowing

This paper presents acoustic analysis for the purpose of automatic evaluation of shadowing speech. We use selfchecked scores of understanding, manual prosodic scores, and TOEIC scores as reference scores of learners’ shadowing speech, and compare these scores with automatic scores based on acoustic features that can reflect phoneme intelligibility and prosodic fluency in terms of intonation, an...

متن کامل

Robust Automatic Evaluation of Intelligibility in Voice Rehabilitation Using Prosodic Analysis

Speech intelligibility for voice rehabilitation has been successfully evaluated by automatic prosodic analysis. In this paper, the influence of reading errors and the selection of certain words for the computation of prosodic features (nouns only, nouns and verbs, beginning of each sentence, beginnings of sentences and subclauses) are examined. 73 hoarse patients (48.3± 16.8 years) read the Ger...

متن کامل

Influence of Reading Errors on the Text-Based Automatic Evaluation of Pathologic Voices

In speech therapy and rehabilitation, a patient’s voice has to be evaluated by the therapist. Established methods for objective, automatic evaluation analyze only recordings of sustained vowels. However, an isolated vowel does not reflect a real communication situation. In this paper, a speech recognition system and a prosody module are used to analyze a text that was read out by the patients. ...

متن کامل

Assessment of Non-native Prosody for Spanish as L2 using quantitative scores and perceptual evaluation

In this work we present SAMPLE, a new pronunciation database of Spanish as L2, and first results on the automatic assessment of Nonnative prosody. Listen and repeat and read tasks are carried out by native and foreign speakers of Spanish. The corpus has been designed to support comparative studies and evaluation of automatic pronunciation error assessment both at phonetic and prosodic level. Fo...

متن کامل

Intelligibility Rating with Automatic Speech Recognition, Prosodic, and Cepstral Evaluation

For voice rehabilitation, speech intelligibility is an important criterion. Automatic evaluation of intelligibility has been shown to be successful for automatic speech recognition methods combined with prosodic analysis. In this paper, this method is extended by using measures based on the Cepstral Peak Prominence (CPP). 73 hoarse patients (48.3± 16.8 years) uttered the vowel /e/ and read the ...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2016